Weasels, Hedges and Peacocks: Discourse-level Uncertainty in Wikipedia Articles

نویسنده

  • Veronika Vincze
چکیده

Uncertainty is an important linguistic phenomenon that is relevant in many areas of language processing. While earlier research mostly concentrated on the semantic aspects of uncertainty, here we focus on discourseand pragmaticsrelated aspects of uncertainty. We present a classification of such linguistic phenomena and introduce a corpus of Wikipedia articles in which the presented types of discourse-level uncertainty – weasel, hedge and peacock – have been manually annotated. We also discuss some experimental results on discourse-level uncer-

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Finding Hedges by Chasing Weasels: Hedge Detection Using Wikipedia Tags and Shallow Linguistic Features

We investigate the automatic detection of sentences containing linguistic hedges using corpus statistics and syntactic patterns. We take Wikipedia as an already annotated corpus using its tagged weasel words which mark sentences and phrases as non-factual. We evaluate the quality of Wikipedia as training data for hedge detection, as well as shallow linguistic features.

متن کامل

Analyzing Iran Daily and US Today in Terms of Meta-Discourse Elements

The role of using meta-discourse elements in writing, especially in research newspapers, is so important that their authors can convey certainty, doubt, and characteristics of the writers in their writings. There are different meta-discourse markers used by various authors in different branches; for example, hedges and boosters are the most important devices in writing. The meta-discourse eleme...

متن کامل

The Use of Hedging in Discussion Sections of Applied Linguistics Research Articles with Varied Research Methods

The discourse of the discussion in research articles is regarded to be of considerable significance—as in this section the findings are interpreted in light of previous research and the authors’ argumentations are put forward as a major contribution (see Hyland, 1999). For this reason, the content and structure of the discussion section have been explored in several studies; however, little att...

متن کامل

Hedges and Boosters in Academic Writing: Native vs. Non-Native Research Articles in Applied Linguistics and Engineering

The expression of doubt and certainty is crucial in academic writing where the authors have to distinguish opinion from fact and evaluate their assertions in acceptable and persuasive ways. Hedges and boosters are two strategies used for this purpose. Despite their importance in academic writing, we know little about how they are used in different disciplines and genres and how foreign language...

متن کامل

Advertising Keyword Suggestion Using Relevance-Based Language Models from Wikipedia Rich Articles

When emerging technologies such as Search Engine Marketing (SEM) face tasks that require human level intelligence, it is inevitable to use the knowledge repositories to endow the machine with the breadth of knowledge available to humans. Keyword suggestion for search engine advertising is an important problem for sponsored search and SEM that requires a goldmine repository of knowledge. A recen...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013